Source | # of sentences | Average logarithmic rank |
---|---|---|
http://am.wikipedia.org/wiki/ዋና_ከተማ | 11 | 5.92 |
http://am.wikipedia.org/wiki/የኩሽ_መንግሥት | 12 | 6.04 |
http://am.wikipedia.org/wiki/ዘሃራ | 14 | 6.45 |
http://am.wikipedia.org/wiki/አሹር_(ከተማ) | 21 | 6.48 |
http://am.wikipedia.org/wiki/ዙር_ግልባጭ | 24 | 6.48 |
http://am.wikipedia.org/wiki/ኦሬክ | 12 | 6.48 |
http://am.wikipedia.org/wiki/ደማስቆ | 15 | 6.49 |
http://am.wikipedia.org/wiki/ኡር-ናሙ | 13 | 6.54 |
http://am.wikipedia.org/wiki/ሰብታ | 12 | 6.56 |
http://am.wikipedia.org/wiki/ፖድጎሪጻ | 11 | 6.56 |
http://am.wikipedia.org/wiki/አልፍ | 15 | 6.56 |
http://am.wikipedia.org/wiki/የባንጯን_ውግያ | 12 | 6.61 |
http://am.wikipedia.org/wiki/ማህፈድ | 11 | 6.62 |
http://am.wikipedia.org/wiki/ጋውት | 13 | 6.63 |
http://am.wikipedia.org/wiki/ቮላፒውክ | 11 | 6.63 |
http://am.wikipedia.org/wiki/ፓሪስ | 13 | 6.64 |
http://am.wikipedia.org/wiki/ኤአናቱም | 14 | 6.64 |
http://am.wikipedia.org/wiki/የአሦር_ነገሥታት_ዝርዝር | 12 | 6.66 |
http://am.wikipedia.org/wiki/ሀኖይ | 13 | 6.66 |
http://am.wikipedia.org/wiki/የፈርዖኖች_ዝርዝር | 33 | 6.67 |
http://am.wikipedia.org/wiki/የኢትዮጵያ_ነገሥታት | 24 | 6.67 |
http://am.wikipedia.org/wiki/ቤት | 11 | 6.68 |
http://am.wikipedia.org/wiki/ጥግ | 14 | 6.68 |
http://am.wikipedia.org/wiki/የአሜሪካ_ሕገ_መንግሥት_22ኛ_ማሻሻያ_አንቀጽ | 11 | 6.69 |
http://am.wikipedia.org/wiki/ቱዊስኮን | 14 | 6.70 |
http://am.wikipedia.org/wiki/ፓርጦሎን | 17 | 6.70 |
http://am.wikipedia.org/wiki/2_ሰኑስረት | 16 | 6.70 |
http://am.wikipedia.org/wiki/ካራን | 14 | 6.71 |
http://am.wikipedia.org/wiki/3_አመነምሃት | 12 | 6.72 |
http://am.wikipedia.org/wiki/ማኑስ | 12 | 6.72 |
Source | # of sentences | Average logarithmic rank |
---|---|---|
http://am.wikipedia.org/wiki/ዕንቁጣጣሽ | 54 | 8.88 |
http://am.wikipedia.org/wiki/አስርቱ_ቃላት | 19 | 8.83 |
http://am.wikipedia.org/wiki/መሐረቤን_ያያችሁ | 17 | 8.71 |
http://am.wikipedia.org/wiki/አዳዲስ_ቀልድ | 12 | 8.68 |
http://am.wikipedia.org/wiki/ኣስያ_ቢንት_መህዙም | 34 | 8.53 |
http://am.wikipedia.org/wiki/ውርዴ_ወይም_ፈንጣጣ | 11 | 8.52 |
http://am.wikipedia.org/wiki/«የሰብዓዊ_መብት_አቀፋዊ_መግለጽ» | 43 | 8.50 |
http://am.wikipedia.org/wiki/ጸጋዬ_ገብረ_መድህን | 32 | 8.40 |
http://am.wikipedia.org/wiki/ጥምቀት | 16 | 8.37 |
http://am.wikipedia.org/wiki/የእቴጌ_ጣይቱ_ደብዳቤዎች | 15 | 8.35 |
http://am.wikipedia.org/wiki/ጋብቻ | 41 | 8.31 |
http://am.wikipedia.org/wiki/የዳግማዊ_ምኒልክ_ደብዳቤዎች | 49 | 8.30 |
http://am.wikipedia.org/wiki/እንቆቅልሽ | 33 | 8.26 |
http://am.wikipedia.org/wiki/ሼህ_ሁሴን_ጅብሪል | 16 | 8.25 |
http://am.wikipedia.org/wiki/ደብረ_ታቦር_(ዓመት_በዓል) | 24 | 8.21 |
http://am.wikipedia.org/wiki/ሆሣዕና_በዓል | 12 | 8.19 |
http://am.wikipedia.org/wiki/ሊኑክስ | 14 | 8.19 |
http://am.wikipedia.org/wiki/አሬን_ስበላ_ከረምሁ_(1)(2) | 23 | 8.17 |
http://am.wikipedia.org/wiki/እንደምን_አደራችሁ | 15 | 8.16 |
http://am.wikipedia.org/wiki/አው_ባድር | 16 | 8.15 |
http://am.wikipedia.org/wiki/ምልጃ | 31 | 8.15 |
http://am.wikipedia.org/wiki/ፐርል_በክ | 15 | 8.15 |
http://am.wikipedia.org/wiki/ሰሎሞን_ተካልኝ | 16 | 8.15 |
http://am.wikipedia.org/wiki/መርየም_የእየሱስ_(አ.ሰ)_እናት | 37 | 8.14 |
http://am.wikipedia.org/wiki/የዶሮ_ጉንፋን | 11 | 8.13 |
http://am.wikipedia.org/wiki/ቤሲክ_(BASIC) | 44 | 8.12 |
http://am.wikipedia.org/wiki/ውክፔዲያ | 69 | 8.11 |
http://am.wikipedia.org/wiki/እማሆይ_ገላነሽ_አዲስ | 68 | 8.10 |
http://am.wikipedia.org/wiki/አጥናፍሰገድ_ኪዳኔ | 22 | 8.10 |
http://am.wikipedia.org/wiki/ጳውሎስ | 14 | 8.08 |
In this subsection we replace average word length by average logarithmic word rank. The logarithm of the word rank is taken because we want to punish words of high ranks only moderately.
First table:
select source, count(distinct i_s.s_id) as cnt_s, round(avg(log(w.w_id-100)),2) as av from sources so, inv_so i_s, inv_w i, words w where so.so_id=i_s.so_id and i_s.s_id=i.s_id and i.w_id=w.w_id and w.w_id>100 group by source having cnt_s>10 order by av LIMIT 30;
6.4.2.1 Average word length for different sources
6.4.2.3 Sources consisting of many / few words with frequency 1
6.4.2.4 Sources with low / high average word length of rare words